منابع مشابه
16-899C ACRL Tetris Reinforcement Learner
Our approach to this problem was to use reinforcement learning with a function approximator to approximate the state value function [RSS98]. In our case, a +1 reward was given for every completed line, so that the value function would encode the long-term number of lines that is going to be completed by the algorithm. In order to achieve this, we extract features from the game state, and use gr...
متن کاملrules of arbitration in force as from 1 january 1998 and rules of conciliation in force as from 1 january 1988
0
متن کاملrules of arbitration in force as from 1 january 1998 and rules of conciliation in force as from 1 january 1988
0
متن کاملTrr 1988
Transportation Research Record: Journal of the Transportation Research Board, No. 1988, Transportation Research Board of the National Academies, Washington, D.C., 2006, pp. 63–66. The analytical procedures for all-way stop-controlled intersections in the 2000 edition of the Highway Capacity Manual (HCM) lack a model to estimate the 95th percentile queue length. This is considered a major shortc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: College & Research Libraries News
سال: 2020
ISSN: 2150-6698,0099-0086
DOI: 10.5860/crln.49.3.139